recsy challenge
EB-NeRD: A Large-Scale Dataset for News Recommendation
Kruse, Johannes, Lindskow, Kasper, Kalloori, Saikishore, Polignano, Marco, Pomo, Claudio, Srivastava, Abhishek, Uppal, Anshuk, Andersen, Michael Riis, Frellsen, Jes
Personalized content recommendations have been pivotal to the content experience in digital media from video streaming to social networks. However, several domain specific challenges have held back adoption of recommender systems in news publishing. To address these challenges, we introduce the Ekstra Bladet News Recommendation Dataset (EB-NeRD). The dataset encompasses data from over a million unique users and more than 37 million impression logs from Ekstra Bladet. It also includes a collection of over 125,000 Danish news articles, complete with titles, abstracts, bodies, and metadata, such as categories. EB-NeRD served as the benchmark dataset for the RecSys '24 Challenge, where it was demonstrated how the dataset can be used to address both technical and normative challenges in designing effective and responsible recommender systems for news publishing. The dataset is available at: https://recsys.eb.dk.
Many-to-one Recurrent Neural Network for Session-based Recommendation
Dadoun, Amine, Troncy, Raphael
This paper presents the D2KLab team's approach to the RecSys Challenge 2019 which focuses on the task of recommending accommodations based on user sessions. What is the feeling of a person who says "Rooms of the hotel are enormous, staff are friendly and efficient"? It is positive. Similarly to the sequence of words in a sentence where one can affirm what the feeling is, analysing a sequence of actions performed by a user in a website can lead to predict what will be the item the user will add to his basket at the end of the shopping session. We propose to use a many-to-one recurrent neural network that learns the probability that a user will click on an accommodation based on the sequence of actions he has performed during his browsing session. More specifically, we combine a rule-based algorithm with a Gated Recurrent Unit RNN in order to sort the list of accommodations that is shown to the user. We optimized the RNN on a validation set, tuning the hyper-parameters such as the learning rate, the batch-size and the accommodation embedding size. This analogy with the sentiment analysis task gives promising results. However, it is computationally demanding in the training phase and it needs to be further tuned.
Two Stages Approach for Tweet Engagement Prediction
Dadoun, Amine, Harrando, Ismail, Lisena, Pasquale, Reboud, Alison, Troncy, Raphael
This paper describes the approach proposed by the D2KLab team for the 2020 RecSys Challenge on the task of predicting user engagement facing tweets. This approach relies on two distinct stages. First, relevant features are learned from the challenge dataset. These features are heterogeneous and are the results of different learning modules such as handcrafted features, knowledge graph embeddings, sentiment analysis features and BERT word embeddings. Second, these features are provided in input to an ensemble system based on XGBoost. This approach, only trained on a subset of the entire challenge dataset, ranked 22 in the final leaderboard.
A Short History of the RecSys Challenge
Said, Alan (University of Skövde)
Today, even though similar approaches are in use, they are usually just one part of complex recommendation approaches that can include large collections of algorithms and data sources. The data set was again provided year that the summer school on Recommender Systems by Moviepilot and was co-organized by TU Berlin. By 2007, the Netflix Prize had The second track focused on recommendation of scientific attracted thousands of participating teams, and the papers. The challenge attracted 30 participating Netflix Prize concluded. At the by Simon Fraser University and Yelp who also 2010 ACM RecSys conference, the seed for what provided the data. CAMRa attracted a moderate the 2014 challenge did not focus on classical recommendation, number of participants, but contributed to establishing but rather on prediction of user engagement, the RecSys Challenge series.